Optimisation problems and replica symmetry breaking 
in finite connectivity spin-glasses 
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A formalism capable of handling the first step of hierarchical replica symmetry breaking in 
■ finite-connectivity models is introduced. The emerging order parameter is claimed to be a proba- 

bility distribution over the space of field distributions (or, equivalently magnetisation distributions) 
inside the cluster of states. The approach is shown to coincide with the previous works in the 
O ' replica symmetric case and in the two limit cases m = 0, m = 1 where m is Parisi's break-point. 

As an application to the study of optimization problems, the ground-state properties of the ran- 
dom 3-Satisfiability problem are investigated and we present a first RSB solution improving replica 
' symmetric results. 

I. INTRODUCTION 



It is now commonly thought that spin-glasses may exhibit highly interesting and non trivial features already at 
the mean- field level [QJ^]. This statement stems from the extensive studies performed during the last twenty years 
on models with infinite connectivity (IC), especially on the celebrated Sherrington-Kirkpatrick (SK) model Q|. Such 
models strongly differ from realistic finite-dimensional systems on two points, that is their large connectivity and the 
G ■ absence of any geometrical underlying structure. As was realized some years ago, only the latter aspect is intrinsic 
to mean-field theory while the irrealistic nature of the connectivity may be cured P,p|-pX|]. Mean-field spin-glasses 
i with finite connectivity (FC) are of importance for at least two reasons. First, they are expected to share common 

properties with finite-dimensional physical systems that IC models cannot exhibit B . Secondly, it is now well-known 
. that optimisation or decision problems can rigorously be mapped onto the ground-state of spin- glass models with 
FC jf]]]. The understanding of complex optimisation problems |jL2| , which would be of practical use for algorithmic 
design, therefore requires the introduction of sophisticated techniques that were invented in the IC models context, 
e.g. replica symmetry breaking (RSB) As regard to their potential interest, the investigation of the spin-glass 
phase of FC models has attracted few attention in the past years ]^1C|] . This situation is probably mainly due to the 
■ technical difficulties arising in the analytical calculations, 
t^. ' In this paper, we show that the complexity of FC models with respect to IC ones manifests itself through the 
OS ■ emergence of new and richer order parameters (Section II). We expose how to compute the free-energy of FC models 
within the one-step RSB scheme in Section III. Section IV is devoted to the derivation of the saddle-point equation 
for the functional order parameter. Throughout the paper, we check that the present formalism gives back the known 
results for the RS theory of FC models and the RSB theory of IC models. Our approach can be applied to any Ising 
FC spin-glass and is therefore of particular relevance to optimization problems. As an illustration, we concentrate 
in Section V upon the so-called random 3-Satisfiability problem (3-SAT), of central importance in complexity theory 



JT 2] . [T3[j . Recent theoretical interpretations [g_4 15| of the numerical data accumulated so far on 3-SAT |l£|] speak indeed 
for the existence of a spin-glass phase, making 3-SAT a valuable testing ground for RSB calculations. A first RSB 
solution is found and shown to improve the RS result. 



H ' II- ORDER PARAMETER FOR FINITE-CONNECTIVITY MODELS 

A. Occupation densities and the multi-level gas picture 



Let 7i be a Hamiltonian depending on N Ising spins Si, i = l,...,N and on some quenched degrees of freedom. 
To compute the equilibrium properties, we resort to the replica method [Q. Once the system has been replicated n 
times and the disorder averaged out, we obtain an effective model of 2™-states spins Si = (Sj, Sf, . . . , S"). In the 
absence of any underlying geometry (e.g. lattice), the effective Hamitonian 7i e // is invariant under any relabelling 
of the sites. As a consequence, 7i e // depends upon the spins only through the 2 n occupation densities c(er) defined 
as the normalized fractions of sites i such that Si = a . This is the very meaning of a mean-field theory : the 
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effective Hamiltonian depends on the spins only through a finite set of global fields rather than an extensive set of 
local ones. For instance, the effective Hamiltonian of the SK model at inverse temperature (3 reads 




£ cfr) c(c? 2 ) (o^) 2 , (1) 
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up to O(l) irrelevant terms. 

In this picture, we may interpret any mean-field IC or FC replicated system as a 'gas' of N particles living on 2" 
interacting levels. Each level is labelled by a vector a comprised of n binary components a a — ±1 and is filled in with 
N.c(a) particles. Taking into account the entropic contribution coming from the combinatorial choices of the sites, 
we obtain the general expression for the n th moment of the partition function Z, 



Z*= \{dc(a)5 PTc^-l e-^W) ( 2 ) 




where the bar denotes the average over the disorder and the free-energy functional reads, to the largest order in N, 

H{c})=H eff ({c}) + ±Y / c(<j)lnc(<j) . (3) 

a 

In the thermodynamical limit, the occupation densities c(a) are determined through the 2™ saddle-point equations 
corresponding to the optimisation of the free energy functional J 7 , 

c(a) = A(n) exp (-/J (4) 

where A(n) is to be chosen to ensure the normalization of the c's. All equilibrium properties can then be computed. 

Let us briefly comment the pros and the cons of the above formalism with respect to the usual formulation which 
involves overlaps between spins belonging to different replicas. Expressions always require to compute 2 n order 
parameters c. In the case of IC models, most of them contain redundant information since only ^n(n — 1) distinct 
overlaps are required. As we shall see in the following, the structure of the order parameters of IC models is indeed 
very simple and does not account for the whole spectrum of mean-field disordered models. For FC systems, the present 
formalism proves to be much more tractable than the usual formulation from the analytical standpoint. It amounts 
to encode all overlaps in a concise way through the generating function c(a) Jl8| | . Another advantage of the present 
approach appears when focusing on Hamiltonians with p-spins interactions (p > 3) ||l9|| . The use of the occupation 
densities c's avoids the physically unclear introduction of Lagrange parameters, that are necessary to define overlaps 
even in the IC case. 



B. Replica symmetric Ansatz 

With a view to undertake RSB calculations, a short discussion of the replica symmetric (RS) Ansatz is illuminating. 
The RS theory of FC models was worked out ten years ago |5||| and may be reformulated within the Thouless- 
Anderson-Palmer (TAP) framework p(J. Inside the single RS state, the spins Si fluctuate around their Gibbs averages 
{Si}. All relevant information rest in the histogram p rs (h) of the effective fields h, = ± tanbT 1 ^)) @. The order 
parameter proves thus to be a function p rs , belonging to the space V of the probability distributions over real numbers. 
How are these results recovered in the present formalism ? RS corresponds to the invariance of the saddle-point c(a) 

n 

under any permutation of its components. In other words, c(<?) is a function of the magnetisation s = a a only. 

a=l 

Inserting the above Ansatz into the extremisation conditions of JF({c}), one easily finds back all thermodynamical 
quantities through the one-to-one correspondence 

r JL ( e Ph*° \ 

< & ) = J R dhp rs (h) [I ^ — Th - j , (5) 
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or, equivalently, 



f e tihs 

c(s)=J R dh Prs (h) {2coshf3h)n , (6) 

where the symbol J R stands for an integral over the whole real axis. Note that the integral over h in (^|) converges if 
the real part of the magnetisation s is smaller than n (in absolute value). Therefore, when the number of replicas n 
tends to zero, the order parameter c(s) may be analytically continued on the imaginary axis. This is precisely what 
one needs to go back to the field distribution 

Prs(h) = J ^ c(is/(3) e- ish (n - 0) . (7) 

Let us underline that a drastic simplification takes place in IC models. The field distribution p rs (and c) becomes 
Gaussian; it is fully described by a variance, the RS overlap q (|], and the functional nature of the order parameter 
is hidden. 



C. Replica symmetry broken Ansatz 

We now consider the first step of Parisi's hierarchical RSB scheme ||, regardless of the possible existence of any 
new RSB pattern in FC spin-glasses. Each replica a is labelled by a couple of integers (b, b) according to the number 
b of the block it belongs to (1 < b < n/m) and its position b inside this block (1 < b < m). Following De Dominicis 
and Mottishaw we see that c(a) is left unchanged under any permutation inside the blocks and depends only on 

m 

the block magnetisations = a'-- b ' b \ To guess the structure of the order parameter, we again resort to the TAP 

b=l 

approach. Consider spin S%. Due to the presence of numerous states in the single cluster, the thermal average (Si) 
fluctuates from state to state. Consequently, the effective field hi is distributed according to a function pi(h) S V. 
We must keep in mind that p t is in turn a random variable depending on the particular spin under consideration. 
The variety of p's may be taken into account by introducing their histogram, that is a functional V\p\. This is a 
normalized distribution over V : J v Vp V\p\ — 1 . We thus look for V such that 



n/m 



e f3hs b 



c(a) = j v V P V[ P ]Uj^p { h)^-^ ¥ (8) 

satisfies the saddle-point equations for T . The generic order parameter V for FC models appears to be much more 
complex than in the IC case. 

As a simple check of the above formalism, let us see how to find back the RS theory. In the latter case, there 
exists a unique state. The effective fields can therefore not fluctuate from 'state' to 'state' and pi is simply a Dirac 
distribution in hi. The fields hi fluctuate according to their distribution p rs . Defining 



Prsip] = / dhp rs (h) S 



it 



p{h) - 8{h - h) 



(9) 



we indeed find that the order parameter (||) simplifies to the RS expression (||,^]). In eq. (JsJ) , the symbol 5 denotes 
the Dirac functional, i.e. the product over all values of h is omitted for simplicity. It was first remarked by Wong 
and Sherrington (in a different formalism) |^] that a generalization of the above equation to the RSB case may be 
obtained by replacing the inner Dirac distribution in eq. (^|) with a function to be optimised over. This Ansatz may 
be correct if the distributions p with a non zero weight can be labelled by ( i.e. are not more 'numerous' than) real 
numbers. In the generic case, a full functional V is a priori needed. 

How can expression (If) be analytically continued to real n, ml We first define v(y) as the number of blocks b of 
magnetisations — y, with y = —m, —m + 2, . . . , m — 2, m. It is easy to check out on (^) that c(er) depends only on 
the set of f (y)'s, as expected from the invariance of the order parameter under permutations between blocks Q. The 
discrete nature of y is merely due to the integer value of m and can be omitted to define an analytical continuation 
of the order parameters. Consequently, v{y) may be any function on the range —m < y < m satisfying the constraint 

77 

dy v{y) = - - , (10) 

777 
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in the small n limit. Finally, the order parameter c becomes a functional over the set of all possible functions v and 
reads, from m), 



\v] = Vp V[p] exp ( / dy v{y) In 

JV \J-rn 



dh p{h)- 



2 cosh [3h) r 



(11) 



Notice that the RS case is recovered by injecting the order parameter (|9|) into (11); one recovers (|6|) with s = 
J™m v i.li) V- l n the generic RSB case, c[v] depends on the whole v function and not only on its first moment p4] 



III. EXPRESSION OF THE ONE-STEP FREE-ENERGY FUNCTIONAL 



A. Methods to compute the free-energy 

Two procedures can be followed to access the thermodynamical properties, depending on the starting point of the 
calculation. 

• From the saddle-point equation : We inject the expression of the order parameter ( p"l| ) into the saddle-point 
equation (^). The resulting equation for P[p] has to be solved. Then, one can compute the free-energy (||), pos- 
sibly using the saddle-point equation to simplify the calculation (especially the entropic term whose calculation 
is uneasy). 

• From the free-energy functional : Since we know that the saddle-point equation is closed within the one-step 
algebra, we may hrst compute the free-energy functional T given in (|^) restricted to one-step order parameters 
(|S|). Once the analytical continuation of T to real to, n(— ► 0) has been carried out, we obtain the saddle-point 
equation for P[p] by differentiating T with respect to the latter. 

While both methods lead to the same result, the second one has an important practical advantage. The (total) 
derivative of T with respect to any parameter is equal to the partial derivative, while this is not necessarily true in the 
first procedure. This is of little interest for most of the control parameters which appear in the effective Hamiltonian 
only (it is usually easy to compute the partial derivative of Tl e f / in (|^) and to perform the analytical continuation - 
see next paragraph). However, considerable simplifications arise when computing the derivative of J- with respect to 
to. In the following, we shall therefore adopt the second procedure. 



B. Energetic contribution 



According to the interpretation given in Section II, the effective hamiltonian 7i e // describes the interactions between 
the different levels a. If the levels interact K by K, the corresponding effective Hamiltonian typically reads 



n 



eff 



^ c(ai)c(a 2 ) . . . c(a K ) I(a%, a 2 , ■ . ■ , o K ) 



(12) 



<Tl ,<T2,...,<TJC 



where the interaction function / is invariant under global permutations of both replicas and levels labels. For all usual 
IC or FC models, the computation of l~teff within the Ansatz (||) as well as taking the limit n — > do not present 
any difficulty. 

As an illustration, we consider three examples : 



the SK model : inserting c(<?) (Q) in 7i e // ([l]), the trace over a can be straightforwardly carried out by writing 

Id 2 

2 

z=0 



(<7i.<7 2 ) 2 



2 dz 2 



exp U ^ of erg 



(13) 



We obtain 



n-iSK 



J3_£_ 
An dz 2 



V Pl V P2 V\p$P\p 2 \ 



dh\dh 2 



pi(hi) p 2 {h 2 ) (coshz + sinh z tanh (5h\ tanh/3/i; 



^(l- q 2 + m ( q 2 - q 2 )) 



(n - 0) 



(14) 
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where 



q = J VpV[p] (Jdhp(h)tajihj3h 



qi = / VpV[p] / dhp(h)(tanh/3hY 
Jv Jr 



(15) 



As expected for IC models, the functional nature of the order parameter is indeed drastically washed out since 
only the two moments qo , qi of V[p] are relevant. 

• the Viana-Bray model : this model was introduced as a FC version of the SK model The Viana-Bray 
Hamiltonian reads 



njVB _ " 



1 - 2_j c ^ 1 ) c ^ 2 ) cosh(/3 CT1.CT2) 



(16) 



where a denotes the mean connectivity per spin. We obtain 



1 



T-/VB 
H eff 



a 

[3m 



V Pl Vp 2 V[p 1 }V[p 2 }\n 



dh\dh,2 



Pi{h\) pi(}i2) (cosh/3 + sinh /? tanh/3/ii tanh/3ft,2) r 
which explicitly depends upon on the whole distribution V[p\. 



(17) 



• the Satisfiability problem : this FC model will be described in Section V. At this stage, we only recall its 
Hamiltonian based on K-spins interactions : 



H eff -~p lU 



K 



c(a!)...c(^) exp -/^n^ 1 ] 



a=l 1=1 



(18) 



where S[.; .) denotes the Kronecker function [jl4j and a is a positive real parameter. We obtain 

-Ulff = ~~ [ 'Dpi ■■■Vp K V[pi] . ..V[p K ]-ln f dhi... dh K 
n JJ (3 J v m [J R 



pi(hi) . . . p K {h K ) \ l + {e- -l) 



n,=i 2 cosh/3ft,j 



(19) 



Note the additional complexity of (|l9| ) with respect to ( |l7j ) due to the presence of multispins interactions. 



C. Entropic contribution 

The calculation of the model-independent entropic contribution 

S = — c(a) In c(a) 



(20) 



to the free-energy is far more complicated. Indeed, contrary to 7i e //, S is not a function of a finite number of the 
integer moments 



(21) 



of the order parameter. The computation of the entropy <S* is exposed in Appendix A. The final expression, in the 
small n limit 



5 



S = - \ VvVv exp 



1 



In 



dx 
2^ 



dy v(y)v(y) ) c[iu] lnc[w] 

m / 

dye~ lxy (2cosx) m expi>(y) 



(22) 



involves a double functional integrals over v and v functions, see Appendix A. The order parameter P enters expression 
(E2n through the c functional (O) as expected. 



D. Back to RS : the m=0 and m=l cases 



In order to be self-consistent, the above formalism has to ensure that the RS free-energy is found back when m = 
or m — 1. We shall now see that this is the case. 

We first look at the energetic part of T . Within the RS Ansatz, the order parameter P[p] simplifies to (^). It 
is a simple exercise to obtain the resulting expressions for the energy, see e.g. We observe that the same 

expressions are found back if we identify the RS field distribution p rs (h) with 

• Case m = : 

p (h)= f VpP[p]p{h) . (23) 
Jv 



More generally, the expansion of the one-step free-energy functional in powers of m coincides with an expansion 
in terms of the cumulants of P[p] ■ For instance, the first non trivial term (of order m) includes the two functions 
correlation 

r(M')= / Vp P\p] p{h) p(ti) (24) 
Jv 

which is diagonal in the RS assumption and shows off-diagonal contributions otherwise Jl7| . 
• Case m = 1 : 

Pl (h)= f VpP[p]S(h-H[p}) . (25) 
Jv 



where 



H[p] = ~ tanlT 1 ( f dh p(h) taahflh) . (26) 



R 



When m = 1, an exponential number of states with exponentially small weights contribute to the partition 
function. As a result, these states may be considered as effective micro-configurations and their cluster as a 
single effective 'state'. Equations (^5 26) express that the mean spin magnetisations in the latter are simply the 



averages of the spins magnetisations over all physical states (with vanishingly small weights). 

The validity of the above relationship between RS theory and the m = 0, 1 cases also holds for the entropic part of 
T ', see Appendix B. In addition, we show in Appendix B how to compute the path integrals in ( |22j ) in the RS scheme 
to obtain the RS entropy 

_ S rs = / =— e- iC,v ln(2cosh/3/>) c rs (iu) (1 - \nc rs {w)) . (27) 
n J R 2?r 



with 



c rs {iv)= / dh p rs {h) e wh . (28) 



R 



The field distribution p rs is obtained through the optimisation of both entropic and energetic contributions to the RS 
free-energy. This approach, that gives the same result as previous works in the RS framework, we shall now extend 
to the first step of RSB. 
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IV. SADDLE-POINT EQUATION FOR THE FUNCTIONAL ORDER PARAMETER 



We shall now obtain the saddle-point equation fulfilled by the order parameter P[p]. To do so, we differentiate both 
entropic and energetic parts of the free-energy functional. 



A. Differentation of the entropic part 



The entropy (|22| ) depends on the order parameter through c (h_l|) only. It results convenient to introduce the 
operator 



K\p,v\ = J Vv exp —i J dy v(y) (i>(y) —ln(J dhp(h)- 



2 cosh [3h) r 



In 



dx 



dye lxy (2 cos a;)" 1 expi>(y) 



(29) 



which depends on both functions p(h) and v(y). Using the above definition, we rewrite the derivative of the entropic 
contribution to the free-energy as 



1 dS 



n dP[p] 

which depends on the field distribution p through 



T>v JC[p, v\ lnc[w] 



(30) 



B. Differentation of the energetic part 

As emphasized in paragraph III. 2, the calculation of the (model-dependent) energetic part of the free-energy, and 
consequently of its derivative does not present any difficulty. It turns out that the latter may always be written under 
the convenient form 



1 dH eff 



T>v /C[p, v\ Cl[iv] 



n dP[p] 

The expressions of VL we now give for the different models of interest. 
• the SK model : 

n SK W] = ~ (m(l - gi)i* + go(^i) 2 + (1 - qi)^) 



where qo , qi have been defined in (ha) and 



dy u(y) y 3 



(31) 



(32) 



(33) 



• the Viana-Bray model : 

n VB \ 



2a 
T 



In 

A e = In 



/ Vpi V[pi] - V exp ( / dy u{y) 

J dhi pi(hi)exp Q (m(A+ + AJ) + a y{A+ - A-)) 

(e = ±l) . 



cosh/3(/ii + e) 



cosh (3 h i 



(34) 
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the Satisfiability problem : 

n SAT M = 



A 



K-l 



aK 



In 



In 



/ V Pl . . .T>p K -i V[pi] ■ ■ -V[pk-i] ^ V exp 

K-l , x 

/ IT d hj pj(hj)exp I - (m + ay)A K _ 1 
jR j=1 \* 



dy u(y) 



1 + (e-" - 1) 



n^Li 1 2 cosh /3 ft. 



(35) 



C. Self-consistency equation for P[p] 

Gathering derivatives (^0|) and ( |3l| ) together, we obtain the following saddle-point equation 

T>v K\p, v] (- \nc[w] - P0,[iv}) = A , (36) 

which has to be satisfied for any p(h). The Lagrange multiplier Ao in (H) is determined through the normalisation of 
P\p\. It does not depend upon p(h) and may be computed for e.g. p{h) = 5(h). Therefore, the saddle-point equation 
(pq) simply means that — lnc[w] is a zero mode of the operator 

Q[p,i>]=lC[p,v]-tC[6(h),v] . (37) 

The kernel of Q contains all constant functionals Ai due to the normalisation of p(h). Therefore, we end up with the 
following saddle-point equation 

lnc[v]+0fl[u] =Ai , (38) 

where Ai can be determined for e.g. v{y) — 0. From ( pd| ) and (|3^), we get Ai = /3f2[0], or equivalently 

c[v] = exp {-pO[v\ + pn[0}) . (39) 

We now have to find the v functions for which equation (|3^) has to be satisfied. P is a distribution of normalised 
probabilities p(h) merely by convention. Both P[p(h)\ and P[a(p) p(h)] must lead to the same physics independently 
of the irrelevant a factors. This is ensured if 



dy v(y) = 



(40) 



that is if condition ( |l0[ ) is satisfied. Consequently, P[p] has to be such that equation (|3^) is correct for any function 
v(y) over the range — m < y < m with zero integral vq (33). 



D. A simple application : infinite-connectivity models 



For the SK model, the self consistency equation (g9|) is obtained from (|ri],|3^) and reads 

oPhy 

Dp V[p] exp I / dy v(y) In 



dh p(h)- — 

R Hy '(2 cosh (3h) r 



exp 



Y (<?o(^i) 2 + (qi - 90)^2) 



for any v such that vq = 0. Solving the above equation, one finds the SK order parameter 

w qi - qo {h- h)(2 cosh I3h) m 



^SK 



[f>] 



dh w qo (h) 6 



p{h) 



J R dh'w qi - qo (h> - ft,) (2 cosh f3h') r 



(41) 



(42) 



where w a (z) = e~ z / 2a / \f2ira. In eq. (^2|), the symbol 5 denotes the Dirac functional, i.e. the product over all values 
of h is omitted for simplicity. As expected, eqs. ([l5]) are identical to the usual self-consistent equations for the RSB 
overlaps qo and qi, see eq. (III. 41) in ||. Notice that the expression of p in ( ]42] ) is in full agreement with the cavity 
derivation of the effective field distribution, see eq. (V.29) in B. 
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V. APPLICATION TO THE 3-SATISFIABILITY PROBLEM 



For generic FC models, the exact resolution of the saddle-point equation ( p9| ) appears to be extremely difficult. At 
zero temperature, the limit indeed of interest for optimization problems, some analytical simplifications take place. 
We shall now see on 3-SAT how quantitative results may be obtained this way. 



A. Presentation of the K-Satisfiability problem 

The satisfiability (SAT) problem is the paradigm of the class of hard (NP-complete) computational problem arising 
in complexity theory A pedagogical introduction to the K-SAT problem, a version of SAT and some of the 

current open issues in theoretical computer science may be found in |L3| . 

K-SAT is defined as follows. Let us consider N Boolean variables {xi — 0, l}i=i,...,jv. Choose randomly K among 
the N possible indices i and then, for each of them, a literal that is the corresponding Xi or its negation xi with equal 
probabilities one half. A clause C is the logical OR of the K previously chosen literals, that is C will be true (or 
satisfied) if and only if at least one literal is true. Next, repeat this process to obtain M independently chosen clauses 
\Ci }e=i,...,M and ask for all of them to be true at the same time (i.e. we take the logical AND of the M clauses). A 
logical assignment of the {xi}'s satisfying all clauses, if any, is called a solution of the K-satisfiability problem. 

For large instances (M,N — ► oo), K-SAT exhibits a striking threshold phenomenon as a function of the ratio 
a = M/N of the number of clauses per variable. Numerical simulations indicate that the probability of finding a 
solution falls abruptly from one down to zero when a crosses a critical value a c (K). Above a c (K), all clauses cannot 
be satisfied any longer. This scenario is rigorously established in the K = 2 case, where a e = 1 |2lj| . For K > 3, much 
less is known; K(> 3)-SAT belongs to the class of hard computational problems, roughly meaning that running times 
of search algorithms are thought to scale exponentially in N. Some upper and lower bounds on a c (K) have been 
derived |^2| and numerical simulations have recently allowed to find precise estimates of a c , e.g. a c (3) ~ 4.2 [l6| . 

In order to study the K-SAT problem, we map it onto a random diluted systems by introducing some spin variables 
Si = ±1 (a simple shift of the Boolean variables) and a quenched (unbiased) matrix Cg.i — 1 (respectively —1) if x% 
(resp. Xi) belongs to the clause Ci, otherwise. Then the energy-cost function 



A I 



i=i 



N 



Ci t i Si 



-K 



(43) 



equals the number of violated clauses and therefore its ground state (GS) properties describe the transition from the 
SAT phase {Eqs = 0) to the UNSAT phase (Eqs > 0). Note that a similar cost function was first introduced for 
neural networks by Gardner and Derrida J2j| . 

The effective Hamiltonian corresponding to the cost function ( f43| ) was given in (|l^) . Previous studies have shown 
that the RS theory was able to find back the critical threshold of the 2-SAT problem but became wrong for K(> 3)- 
SAT instances jl4|,[l5|]. In the following, we therefore concentrate upon the most interesting K — 3 case. We briefly 
recall the RS solution and then present a first RSB solution. 



B. Saddle-point equation and RS solution 

Following the above-mentioned procedure, the saddle-point equation for V reads 

IncH = 3a J V Pl Vp 2 V\pi\V\pi\ Q e*~ + \ e*+ - 1 

where (for e = ±1) 

$ f = 

J — m L J h 

' m + e y 



(44) 



dyv(y)\n< I dhidh 2 pi(hi)p 2 (h 2 ) x 
Ur 



x exp 



In 1 



-/J_l 



2 "'V' ' (1 + e- 2 ^)(l + e~^) _ 
Eq. ( fi4| ) has to be satisfied for all v of integral zero. The ground state properties are obtained by sending (3 — » oo. 
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Let us briefly recall the RS result p| . Inserting Ansatz (||) into (H), we find that the simplest RS solution includes 
half-integer fields only |,0Jl|,|| 

Prs (h)= J2 e^hi^sU-pi (45) 

l=-oo V lJ 

where Ig is the I th modified Besscl function and 7 is self-consistently determined through 

7 =^(l- e -^o(7)) 2 (46) 

In addition to the SAT phase solution 7 = Eqs = 0, there appears a metastable solution 7 > 0, Eqs above 
a — 4.67, see Fig. 1. This solution becomes thermodynamically stable at a = 5.18, well above the 'experimental' 
threshold a ~ 4.2. The latter is indeed thought to coincide with a first-order spin-glass transition ]T^Jl5| |. Moreover, 
the presence of 3-spins interactions in (|l8|) suggests that the one step RSB solution could be exact for 3-SAT and that 
m = 1 at the threshold |l{| . 



C. Simplest RSB solution 



We now turn to the RSB solution. In view of (|45|), we first restrict to distributions p's on half- integer fields h. 
Secondly, the cavity theory teaches us that p is biased in favor of large fields (see Section V.2 in Q for a clear 
explanation of this point). For IC models, large /i's indeed benefit from a Boltzmann factor e /3m '' 1 ' as can be seen on 
([f2|). We thus propose the following form for the distributions p with a non zero weight V[p] (for m = 1), 

p(h) = U {j3) £ p t exp (p^P) s(h-£) , (47) 

where w(f3) is a normalization factor and the p/s do not contain exponential terms in /?. Identity ([47]) is merely the 
simplest hypothesis compatible with the saddle-point equation; we shall come back to this point at the end of the 
letter. Sending the temperature to zero, we find that V\p] is a function of two variables only which are computed 
from p, namely the ends of its support £^ and £^ + \ This results from the zero temperature expression of the order 
parameter 



]im c[u/0\= V V ti - Ui+) e»- i ^+»+ iW , (48) 

£(-)<£(+) 



where V^-) equals the sum of the weights of the p functions having support Note that @ depends 

on v through 

v- = J dy v(y) y 

v+ = f dy v{y) y (49) 
Jo 

only, in agreement with the statement that p7| ) is the simplest non RS solution to (|44|) . After some algebra, fl SAT [v] 
( p5[ ) can also be shown to depend on V- , v + only. The self-consistent equation for V [p] simplifies into an implicit 
equation for the matrix T-^(-) £<+> which can be easily solved, 



e- A 



2) 2^+) _*)!(*_*(-))! • ^ 



As in the RS solution ([45]), 7 controls the decay of the probability weights of large fields. The new parameter A sets 
the width of the supports of the p's, that is the magnitude of the fluctuations of the effective fields from state to state. 
When A = 0, the supports of the p's shrink to single points and the RS solution ( [45|) is recovered, in agreement with 
(^). The two self-consistent equations on 7 and A as well as the expression of the corresponding GS energy Eqs are 
given in Appendix C. The results are displayed Fig. 1. In addition to the trivial solution 7 = A = Eqs — 0, there 
appears another solution for a > 4.45 with < 7 < A and a metastable, i.e. negative, GS energy. This solution 
becomes thermodynamically stable at a = 4.82. 
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VI. CONCLUSION 



The formalism we have presented in this article has permitted us to hnd an explicit RSB solution improving the 
RS saddle-point. Yet, the predicted threshold a c = 4.82 exceeds the 'experimental' value. This discrepancy may stem 
from the assumption (^). Contrary to the IC case, the payoff for large fields appearing in p might be a non-linear 
function of \h\. If so, an inspection of the saddle-point equation ( |i4| ) shows that h is not constrained to take half- 
integer values anymore. This refinement scheme is reminiscent of the iterative procedure used in the RS theory to 
improve the si mp lest solution (^). By increasing the resolution on the fields, the RS thresholds decreased from 5.18 
down to 4.60 II . 

However, the meaning of non half-integer fields is far from being clear. From a physical point of view, the effective 
fields should be half-integer valued at zero temperature. Previous studies performed for the Viana-Bray model have 
nevertheless shown that the half-integer field distribution is unstable with respect to longitudinal (within the RS 
sector) fluctuations ||. Then, we face the following dilemna. The RS saddle-point equation admits a physically 
sensible but unstable solution and many other ones, whose significances are dubious. In this context, the RSB 
solution presented in the last Section has a remarkable property. While the supports of the p's with non zero weights 
contain physical (i.e. half-integer) values only, the resulting effective distribution P(h) ( p5|) includes Dirac peaks on 
the integer multiples of a quarter! More precisely, the RSB ground state energy is equal to the ground state energy 
of the RS solution with fields which are integer multiples of a quarter p5| . Work is in progress to reach a better 
understanding of this puzzling coincidence p6| . 

Acknowledgments : I am particularly indebted to R. Zecchina for very fruitful contributions. I also thank A. Bal- 
dassarri, D. Dean, S. Franz and I. Kanter for interesting and motivating discussions. 



APPENDIX A: CALCULATION OF THE RSB ENTROPY 



Clearly, the analytical continuation of the C/s to real i in the vicinity of the unity will allow us to compute the 
entropy @), 



S ~ dl 



Ci 



(Al) 



For integer £, the moments Ci are easily expressed from (g) as Cg ~ 1 + nTi in the small n limit with 

Tt= f Vp 1 ...V Pl ,V[p 1 }...V[ Pe }-\n [ dh x ...dh t 
Jv m [J R 



pi{hx) . . .pe(he) 



2 cosh(/3 Y^j=i h>j 
n,=i 2 cosh (3hj 



(A2) 



In order to ac hie ve an analytical continuation of r^, we proceed in two steps. First, consider the argument A of the 
logarithm in QA2). Defining 



i r r 

t>(y;Pi, ■ ■ ■ ,Pi) = V^ln dhpj(h) 
3=1 Ur 



cPhy 



(2 cosh (3hy 



the latter may be rewritten as 

A[u(y;pi, ...,pe) 



e lxy (2 cosh x) m expP(iy;pi,...,pi) 

Z7T 

dx f m 

— / dy e lxy (2 cos x) m expv(y;p 1 ,...,p e ) 



(A3) 

(A4) 
(A5) 



where the last expression has been obtained t hrou gh the rotation (x,y) — > (ix, —iy). Note that the range of integration 
over y in ( A5) makes the integral over h in (A3) convergent. We now introduce the probability distribution Qe[g] of 
the functions g{y; p\, 



L) • • • 1 



Q t [i>} = ( Vp 1 ...Vp e V[p 1 }...V[p e }5[v(y)-v(y;p 1 ,...,p e )} 
Jv 



(A6) 
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and rewrite Tg as a functional integral over all possible g functions (with support [—to, to]) weighted with measure 

a. 



T e = fvvQi[u] - lnA[%)] 

J TO 



(A7) 



The second step of the calculation lies in the analytical continuatio n of the measure Qi to real i. This may be achieved 
through an exponential representation of the functional Dirac in (A6). We thus obtain 



d_ 

M 



Qe[i>] = / Vv exp —i I dyu{y)v{y) c[v]\a.c[v] 



(A8) 



where the functional c[u] is given in (|ll|). In equation (A8), the measure T>v ensures the correct normalisation of the 
Dirac functional, that is factors 1/2tt have to be included when discretising the path integral over v. The resulting 
expression of the entropy is given in 



APPENDIX B: RS ENTROPY AND THE M = 0,1 CASES 

In this Appendix, we first derive the RS expression of the entropy (^2|). We then show how the latter is recovered 
when m = or m = 1. 



1. Calculation of the RS entropy 

Consider the RS expression © for P[p]. The functional c[v] (0) simplifies to 

c(vq,v\) = / dh Prs(h) exp (— i i/q to ln(2 cosh /? ft,) + i v\ [3 h) 
Jr 



(Bl) 



It depends on v through the two moments isq, v\ only, see (|33|). We now introduce the series expansion of v{y) around 

y = o, 



3=0 



(B2) 



Combining the coefficients definitions (33) and (B2), we rewrite the coupling term between v and v in ( |22| ) as follows 

(B3) 



exp ^-i J dy v{y)v{y)^j = exp f-i^i>j fjj 



The integration over all coefficients Vj with j 1 > 2 gives Vj = 0, Vj > 2. Therefore, only the p's that are linear functions 
of their argument y survive. We obtain 



-S=- 

n JR 

i,n 

TO 



di/^dv^ duidvi 
2tt 2tt 



e~ lxv (2 cosh x) m exp(i>o + i>i iy) 

ZTT 



(B4) 



The argument of the logarithm in (B4) simply reads e va (2 coshi>i)™ 1 . The remaining integrals on i>o, vo, i>\, v\ may 
then be carried out and the final result is given in (E7h . 
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2. Case m=0 



We expand the logarithm in (pip and find 



I In 

m 



[ e~ lxy (2coshx) m equity) 

Jr 2n 



0(0) : f dydy 
2tt 



iyy m(2coshy) e'W-W + 0(m) 



(B5) 



in the small to limit. We now consider the first term on the r.h.s of (B5) and integrate out all modes 0(y 7^ 0) in (|2 
The resulting c[z/] reads, for small to, 



c[i/] = 1 + % m i/(0) / dhp {h) ln(2 cosh /%) + 0(m 2 ) , 
Jr 

where po has been defined in (pqV Integrating <?(0), i/(0) out, we obtain a first contribution to the entropy (|2 

- Si = [ dh Po (h) ln(2cosh/3/i) 



(B6) 



(B7) 



We now focus on the second part of the r.h.s of (B5). By integrating the O's out, we get u(y) — except v(iy) = 
—v(Q) = i- The corresponding functional (O) reads 



giving rise to the following entropy 



c[u] = c(y) = / dhp (h) 



dydy 



Jf3hy 



e lyy ln(2coshy) c(y) lnc(y) 



ir 2^ 

Summing up Si and S2, the RS entropy (|27|) is recovered with p rs = pq. 



(B8) 



(B9) 



3. Case m=l 



For m = 1, the argument of the logarithm in fl22j ) reads e v + + e v - , with v t = 0(ie), e — ±1. For all y 7^ ±1, the 
integration over 0(y) gives v(y) = 0. The entropy then reads 



1 



-S 



dv + dv + dv_dv_ 



ir 2tt 
where, using definition (pq), 



2tt 



c(^ + ,i/_)lnc(^ + ,i/_)e 4 ( p +' / + +i> -' / -) lnfe^ + e*" 



c(i/+, f_) = / dh pi(h) cxp (i (v + + i/_) ln(2 cosh/3/i) — i — i/_) /3/i) 
Jr 

We make the following unitary change of variables 

^0 = + v -) 

vo = v+ + v- 

h = \(P+ - v-) 

V X = V + -V- , 



(BIO) 



(Bll) 



(B12) 



and find back the RS expression (B4). Therefore, when to = 1, the RS entropy is recovered with p rs = p±. 
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APPENDIX C: SELF-CONSISTENCY EQUATIONS FOR 7, A AND GROUND-STATE ENERGY E 



The self-consistent equations for 7 and A read 



7 = 3a( 2~T~ ri 



A = 3a 



1 _ro 

2 2 



(CI) 
(C2) 



where 



ro= e^- A he(A) 

l=-OG 
1 OO 

ri = ~ E e ~ 7 " A/ *(7) [/ 2 ^i(A)+W(A)] 



(C3) 



To compute the ground state energy E, we insert the saddle-point solution for c(<?) into the free-energy (||) and 
keep the linear term in n only. We then send (5 — > 00 and find 



E=-a 
A 



l-r 



l-r 



3a 

T 



l-r 



4 (ro + ri) - -(r + r 2 + 2n) 

1 OO 

7-2 = 9 S e_7 ^ A [Jm- 2 (A) +/ W+2 (A)] 



(C4) 



Note that A = is always a solution of the self-consistent equation (C2); eq. (CI) then gives back the RS saddle-point 
constraint on 7 (^6|). In this case, the ground state energy E flC4j) simplifies to 

E RS = J(l-e-T r Io(7)-3e-T/i(7)) , (C5) 



in agreement with the findings of [14 15 1. 
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FIG. 1. Decay parameters 7 (thin curves-left side scale) and ground state energies E (bold curves-right side scale) for the 
RSB (full curves) and the RS (long-dashed curves) solutions as functions of the number of clauses per variables a. The RSB 
width parameter A is shown on the upper dotted curve. The vertical dashed lines indicate the spinodal points orsb = 4.45 
and ors = 4.67. 
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